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Abstract 

Background: Long noncoding RNAs (IncRNAs) have been shown to be involved in the development and progression of 
lung cancer. However, the roles of IncRNAs in lung cancer are not well understood. 

Methodology/Principal Findings: We used a high-throughput microarray to compare the IncRNA and messenger RNA 
(mRNA) expression profiles in lung adenocarcinoma and normal tissue (NT) samples. Several candidate adenocarcinoma- 
associated IncRNAs were verified by real-time quantitative reverse transcription polymerase chain reaction (PCR) analysis. 
Using abundant and varied probes, we were able to assess 30,586 IncRNAs and 26,109 mRNAs in our microarray. We found 
that 2,420 IncRNAs and 1,109 mRNAs were differentially expressed (>2-fold change) in lung adenocarcinoma samples and 
NT, indicating that many IncRNAs were significantly upregulated or downregulated in lung adenocarcinoma. We also found, 
via quantitative PCR, that 19 IncRNAs were aberrantly expressed in lung adenocarcinoma compared with matched 
histologically normal lung tissues. Among these, LOCI 001 32354 and RPLP0P2 were the most aberrantly expressed IncRNAs, 
as estimated by quantitative PCR in 100 pairs of lung adenocarcinoma and NT samples. 

Conclusions/Significance: Our study ascertained the expression patterns of IncRNAs in lung adenocarcinoma by microarray. 
The results revealed that many IncRNAs were differentially expressed in lung adenocarcinoma tissues and NT, suggesting 
that they may play a key role in tumor development. 
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Introduction 

Lung cancer has the highest mortality rate of all cancers, and its 
incidence is gradually growing [1]. Non-small cell lung cancer 
(NSCLC) is the most common type of lung cancer (accounting for 
80% of all cases) and includes squamous cell carcinoma, 
adenocarcinoma, and large cell carcinoma. Even though surgical 
resection, radiation therapy, and chemotherapy technologies are 
continuously improving, patients with lung cancer remain 
exceedingly vulnerable to relapse and death [2]. The cure rate 
for lung cancer is low and the average 5-year survival is less than 
15% [3-6]. In recent years, a growing proportion of lung 
adenocarcinomas have been diagnosed as NSCLC that are 
attributable to causes such as environmental pollution. However, 
the mechanisms underlying lung cancer development have not 
been elucidated; thus, the study of lung cancer remains extremely 
important. 

Long noncoding RNAs (IncRNAs) are transcript RNA mole- 
cules longer than 200 nucleotides that do not encode a protein and 



reside in the nucleus or cytoplasm [7]. LncRNAs are usually 
divided into five categories: sense, antisense, bidirectional, 
intronic, and intergenic. In recent years, a large number of 
IncRNAs have been identified, prompting the creation of the 
human IncRNA database, which provides IncRNA expression 
profiles and other important information [8], The abnormal 
expression of IncRNAs has been implicated in a range of diseases, 
including cancer [9] . 

Studies have shown that IncRNAs known to be difFerentially 
expressed in normal cells and tumor cells are important for the 
regulation of gene expression; therefore, aberrant expression of 
IncRNAs can lead to abnormalities of gene expression and 
tumorigenesis. The altered expression of IncRNAs is a feature of 
many types of cancers and has been shown to promote the 
development, invasion, and metastasis of tumors by a variety of 
mechanisms [9,10]. LncRNAs regulate expression on the epige- 
netic, transcriptional, and post-transcriptional levels [11-13]. 

LncRNAs have been shown to be involved in the development 
and progression of lung cancer; however, research concerning 
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IncRNA involvement in lung cancer is in its infancy. Known lung 
cancer-associated IncRNAs are few and include HOTAIR, HI 9, 
ANRIL, MALAT1 (lung adenocarcinoma associated transcript 1) 
[14,15], SCAL1 (smoke and cancer-related long-chain non-coding 
RNA1) [16], IncRNA AK126698 [17], and IncRNA GAS6-AS1 
(GAS6 antisense RNA1) [18]. The identification of additional lung 
cancer-associated IncRNAs and their mechanism of action require 
further investigation. 

In this study, we analyzed the expression patterns of IncRNAs 
and mRNAs in lung adenocarcinoma samples and compared 
them with the corresponding patterns in adjacent nontumorous 
tissue (NT) samples. Several of the differentially expressed 
IncRNAs were evaluated by SYBR RT-PCR in 100 pairs of 
tissue samples. Our results suggest that IncRNA expression 
patterns may provide new molecular biomarkers for the diagnosis 
of lung adenocarcinoma. 

Materials and Methods 

Patient samples 

The lung adenocarcinoma samples and corresponding NT 
samples were prospectively collected from 105 patients of the First 
Affiliated Hospital of Wenzhou Medical University, China, from 
April 2012 to August 2013. Of these patients, five (the basic 
medical records see Table SI in File SI) were used for microarray 
analysis of IncRNAs and 1 00 were used for additional evaluations 
(Table S2 in File SI). The diagnosis of adenocarcinoma was 
confirmed by histopathology. The lung adenocarcinoma and 
matched NT samples were snap-frozen in liquid nitrogen 
immediately after resection. This study was approved by the 
Institutional Ethics Review Board of the First Affiliated Hospital of 
Wenzhou Medical University, and all patients provided written 
informed consent for this study. 



RNA extraction 

Lung adenocarcinoma cells were obtained by laser microdis- 
section; the proportion of cancer cells in the tissue sections was 
100%. We combined the five lung adenocarcinoma samples and 
the five corresponding NT normal samples, and the two grouped 
samples were subjected to RNA extraction. Total RNA was 
extracted using Trizol reagent (Invitrogen, Carlsbad, CA, USA), 
according to the manufacturer's protocol. The integrity of the 
RNA was assessed by electrophoresis on a denaturing agarose gel. 
A NanoDrop ND-1000 spectrophotometer was used for the 
accurate measurement of RNA concentration (ODggo), protein 
contamination (OD260/OD280 ratio), and organic compound 
contamination (OD 2 bo/OD 2 3o ratio). 

Microarray and computational analysis 

For microarray analysis, an Agilent Array platform (Agilent 
Technologies, Santa Clara, CA, USA) was employed. Sample 
preparation and microarray hybridization were performed based 
on the manufacturer's standard protocols with minor modifica- 
tions. Briefly, mRNA was purified from total RNA after removal 
of rRNA by using an mRNA-ONLY Eukaryotic mRNA Isolation 
Kit (Epicentre Biotechnologies, USA). Then, each sample was 
amplified and transcribed into fluorescent cRNA along the entire 
length of the transcripts without 3' bias by using a random priming 
method. The labeled cRNAs were hybridized onto a Human 
LncRNA Array v3.0 (8 x60 K; Arraystar), which was designed for 
30,586 IncRNAs and 26,109 coding transcripts. The IncRNAs 
were carefully constructed using the most highly respected public 
transcriptome databases (RefSeq, UCSC Known Genes, GEN- 
CODE, etc.) as well as landmark publications. Each transcript was 
accurately identified by a specific exon or splice junction probe. 
Positive probes for housekeeping genes and negative probes were 
also printed onto the array for hybridization quality control. After 
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Figure 1. Box plots and Scatter plots showing the variation in IncRNA and mRNA expression between the lung adenocarcinoma 
and normal lung tissue arrays. (A-B) Box plots showing the distribution of the IncRNA (A) and mRNA (B) profiles. After normalization, the 
distributions of the log2-ratios among the tested samples were nearly the same. (C-D) Scatter plots showing the variation in IncRNA (C) and mRNA 
(D) expression between the lung adenocarcinoma and normal lung tissue arrays. The values of the X and Y axes in the scatter plot are averaged 
normalized values in each group (log2-scaled). The IncRNAs above the top green line and below the bottom green line are those with a >3-fold 
change in expression between tissues. 
doi:1 0.1 371 /journal.pone.01 04044.g001 
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Figure 2. Heat map and hierarchical clustering of IncRNA and mRNA profile comparison between the lung adenocarcinoma and 
normal lung samples. (A) IncRNA (B) mRNA. 
doi:1 0.1 371 /journal.pone.01 04044.g002 



washing the slides, the arrays were scanned using an Agilent 
G2505C scanner, and the acquired array images were analyzed 
with Agilent Feature Extraction software (version 11.0.1.1). 
Quantile normalization and subsequent data processing were 
performed using the GeneSpring GX vl2.0 software package 



(Agilent Technologies). The microarray work was performed by 
KangChen Bio-tech, Shanghai, People's Republic of China. 

Functional group analysis 

GO analysis was derived from Gene Ontology (www. 
geneontology.org), which provides three structured networks of 
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Figure 3. Pathway analysis of upregulated mRNAs in lung adenocarcinoma. Seven upregulated pathways were identified, including 
ethanol metabolism, viral carcinogenesis, RNA transduction, and cell cycle pathways. 
doi:1 0.1 371 /journal.pone.01 04044.g003 
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Figure 4. Pathway analysis of downregulated mRNAs in lung adenocarcinoma. Twenty-four downregulated pathways were identified, 

including propionate metabolism and fatty acid metabolism pathways. 

doi:10.1371/journal.pone.0104044.g004 



defined terms that describe gene product attributes. The P-value 
denotes the significance of GO Term enrichment in the 
differentially expressed mRNA list (P^O.05 was considered 
statistically significant). We also performed pathway analysis for 
the differentially expressed mRNAs based on the latest KEGG 
(Kyoto Encyclopedia of Genes and Genomes) database. This 
analysis allowed us to determine the biological pathways for which 
a significant enrichment of differentially expressed mRNAs existed 
(P^0.05 was considered statistically significant). 



cients (Pearson correlation coefficient, PCC) was calculated 
between the coding and non-coding/ coding and coding with the 
use of R statistical analysis. PCC> = 0.98 was considered 
statistically significant in both cases. The CNC (coding-non- 
coding)/CC (coding-coding) network was drawn with the 
Cytoscape (v2.8.1) software, which green nodes represent coding 
gene (mRNA), red nodes represent non-coding (IncRNA). The 
solid line represents a positive correlation between the two nodes, 
the dotted line represents a negative correlation. 



Construction of the Coding-non-coding Gene 
Coexpression Network 

The weighted coexpression network is constructed by calculat- 
ing a pairwise correlation matrix between all probe sets across 
microarray samples. The resulting Pearson correlation matrix was 
transformed into an adjacency matrix. Microarray data can be 
noisy, and the number of samples is often small, so we weighed the 
Pearson. 

correlations by taking their absolute value and raising them to 
the power. The nodes of coexpression network correspond to gene 
expressions, and edges between genes are determined by the 
correlations between gene expression Pearson correlation coeffi- 



Quantitative PCR 

Total RNA was extracted from frozen lung adenocarcinoma 
tissues by using TRIzol reagent (Invitrogen) and then reverse- 
transcribed using an RT Reagent Kit (Thermo Scientific), 
according to the manufacturer's instructions. LncRNA expression 
in lung adenocarcinoma tissues was measured by quantitative 
PCR by using SYBR Premix Ex Taq and an ABI 7000 
instrument. Some candidate IncRNAs were validated by SYBRP 
PCR, these gene primers in the study for Q-PCR see Table S3 in 
File SI. Among these, two IncRNAs that were significandy 
differentially expressed in adenocarcinoma and normal tissues 
(LOCI 00 132354, RPLP0P2) were evaluated in all of the patients 
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Figure 5. Comparison between gene chip data and qPCR result. RP11-1C1.7, RP4-575N6.5, RP1 1-473M20.1 1, XLOC_003286, CTA-363E6.2, 
RP5-826L7.1, ZNF295-AS1, RPLP0P2, AC079776.2, RP13-514E23.1, LOC1 00499405, RP1-90D4.3, XLOC_012255, RP1 1-445K13.2, LOC100132354, 
AC004166.7, XLOC_003405, RP1 1-893F2.6, RP1 1-909N17.3 determined to be differentially expressed in lung adenocarcinoma samples compared with 
NT samples in six patients by microarray were validated by qPCR. The heights of the columns in the chart represent the log-transformed median fold 
changes (T/N) in expression across the six patients for each of the four IncRNAs validated; the bars represent standard errors. The validation results of 
the 19 IncRNAs indicated that the microarray data correlated well with the qPCR results. 
doi:10.1371/journal.pone.0104044.g005 
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Figure 6. The expression level of RPLPOP2 and LOCI 001 32354 
between one hundred lung adenocarcinoma sample and the 
normal lung sample. L0C1 001 32354 expression of lung adenocar- 
cinoma was significantly higher than the adjacent tissues (Mann- 
Whitney U = 126.00, P = 0.01), while RPLP0P2 expression of lung 
adenocarcinoma was significantly lower than the adjacent tissues 
(Mann-Whitney U = 178.78, P = 0.002). 
doi:1 0.1 371 /journal.pone.01 04044.g006 

included in this study. Total RNA (2 mg) was transcribed to 
cDNA. PCR was performed in a total reaction volume of 20 
including 10 "J of SYBR Premix (2x), 2 (J.1 of cDNA template, 
1 JLll of PCR forward primer (10 mM), 1 jH.1 of PCR reverse primer 
(10 mM), and 6 (_Ll of double-distilled water. The quantitative real- 
time PCR reaction included an initial denaturation step of 1 0 min 
at 95°C; 40 cycles of 5 s at 95°C, 30 s at 60°C; and a final 
extension step of 5 min at 72°C. All experiments were performed 
in triplicate, and all samples were normalized to GAPDH. The 
median in each triplicate was used to calculate relative IncRNA 
concentrations (ACt = Ct median IncRNA — Ct median 
GAPDH), and the fold changes in expression were calculated [19]. 

Statistical methods 

All results are expressed as mean ± standard deviation. 
Statistical analysis was performed for the comparison of two 
groups in the microarray, and analysis of variance for multiple 
comparisons was performed using the Student's i-test. Differences 
with P<0.05 were considered statistically significant in both cases. 
The fold change and the Student's i-test were used to analyze the 
statistical significance of the microarray results. The false discovery 
rate (FDR) was calculated to correct the P- value. The threshold 
value used to designate differentially expressed IncRNAs and 
mRNAs was a fold change of >2.0 or <0.5 (P<0.05). 

Results 

Overview of IncRNA profiles 

To study the potential biological functions of IncRNAs in lung 
adenocarcinoma, we examined the IncRNA and mRNA expres- 
sion profiles in human lung adenocarcinoma through microarray 
analysis (Fig. 1). For this analysis, authoritative data sources 
containing more than 30,586 IncRNAs were used. The expression 
profiles of 2,420 IncRNAs indicated that they were differentially 
expressed (fold change ^2.0 or <0.5; P<0.05) between lung 
adenocarcinoma and normal lung samples. Among these, 1,213 
IncRNAs were found to be upregulated more than two-fold in the 
lung adenocarcinoma group compared to the normal lung group, 
while 1,207 IncRNAs were downregulated more than two-fold 
(P<0.05; Table S4, S5 in File SI and Fig. 1). 



LncRNA classification and subgroup analysis 

The expression profiles of 564 intergenic IncRNAs indicated 
that they were differentially expressed (fold change S2.0, P<0.05) 
between lung adenocarcinoma and normal lung samples. Among 
these, 338 were upregulated and 226 were downregulated. We 
also identified some nearby coding genes that may be regulated by 
these IncRNAs (Table S6 in File SI). LncRNAs with enhancer-like 
functions (IncRNA-a) were identified using GENCODE annota- 
tion. The expression profiles of 77 enhancer-like IncRNAs 
indicated that they were differentially expressed (fold change & 
2.0, P<0.05) between lung adenocarcinoma and normal lung 
samples. Among these, 43 were upregulated and 34 were 
downregulated. We also identified some nearby coding genes that 
may be regulated by these enhancer-like IncRNAs (Table S7 in 
File SI). HOX cluster profiling: This data table contains 83 HOX 
clusters (Table S8 in File SI). 

Overview of mRNA profiles 

In total, 1,109 mRNAs were found to be differentially expressed 
between the lung adenocarcinoma and normal lung samples, 
including 278 that were upregulated and 831 that were 
downregulated (Tabel S9-S10 in File SI, Figs. 1-2). 

GO analysis 

The genes corresponding to the downregulated mRNAs 
included 357 genes involved in biological processes, 98 genes 
involved in cellular components and 96 genes involved in 
molecular functions. The genes corresponding to the upregulated 
mRNAs included 244 genes involved in biological processes, 80 
genes involved in cellular components, and 69 genes involved in 
molecular functions(Tabel S 1 1 — S 1 2 in File SI). 

Pathway analysis 

Seven upregulated pathways were identified, including ethanol 
metabolism, viral carcinogenesis, RNA transduction, and cell cycle 
pathways. Twenty-four downregulated pathways were identified, 
including propionate metabolism and fatty acid metabolism 
pathways (Tables S13 in File SI and figure 

LncRNA-mRNA co-expression network 

We build the LncRNA-mRNA co-expression network of four 
important IncRNAs, including RP 1 1 - 1 C 1 . 7 (ENST000004 1 6930), 
RPLP0P2 (ENST00000492786), LOCI 00 132354 (NR_024478), 
RP11-909N17.3 (ENST000005 19762), See Table S14 in File SI. 

Real-time quantitative PCR validation 

Based on the features of the differentially expressed IncRNAs, 
such as fold difference, gene locus, and nearby encoding gene, we 
initially identified a number of interesting candidate IncRNAs for 
further analysis (including RP11-1C1.7, RP4-575N6.5, RP11- 
473M20.11, XLOC_003286, CTA-363E6.2, RP5-826L7.1, 
ZNF295-AS1, RPLP0P2, AC079776.2, RP13-514E23.1, 
LOC 100499405, RP11-90D4.3, XLOC_012255, RP11- 
445K13.2, LOC100132354, AC004166.7, XLOC_003405, 
RP11-893F2.6, and RP1 1-909N1 7.3). We verified the expression 
of these IncRNAs by real-time quantitative RT-PCR by using 
GAPDH as a reference gene and by calculating the 2 _AACT 
values. We found that the microarray results for several of the 
IncRNAs were consistent with the results of RT-PCR (Fig. 5). Of 
these, LOC 100 132354 and RPLP0P2 exhibited the most signif- 
icantly changed expression in our analysis of 100 pairs of lung 
adenocarcinoma and normal lung tissue samples. The expression 
of LOC100132354 was significantly higher in lung adenocarcino- 
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ma than in the adjacent tissues (Mann- Whitney U= 126.00, 
P = 0.01), while the expression of RPLP0P2 was significantly lower 
in lung adenocarcinoma than in the adjacent tissues (Mann- 
Whitney U= 178.78, P = 0.002) (Fig. 6). 

Discussion 

According to the 2012 China Oncology Annual Report, in 
2009, the incidence and mortality of lung cancer was the highest 
among cancers in male patients and the second highest among 
cancers in female patients in China. Lung adenocarcinoma is a 
type of NSCLC that has rising incidence rates in women and non- 
smokers. However, the pathogenesis of lung cancer remains 
unclear; therefore, further study of lung cancer is of great 
importance. 

LncRNAs play an important role in many biological processes, 
including X-chromosome inactivation, gene imprinting, and stem 
cell maintenance [20,21]. Furthermore, IncRNAs are important 
factors in the control of gene expression in cancer [22], and 
IncRNAs such as HOTAIR have been shown to play an important 
role in the development and progression of tumors [9] . It has also 
been demonstrated that IncRNAs are differentially expressed in 
normal cells and tumor cells. As IncRNAs constitute an important 
class of gene expression regulatory factors, their aberrant 
expression would inevitably lead to abnormal gene expression 
levels, which may result in tumorigenesis. Promoters bind to many 
transcription factors by mechanisms such as chromosomal 
rearrangements and transfer elements [23]. 

In this study, we analyzed IncRNA expression profiles in the 
tissues of lung adenocarcinoma patients to uncover the potential 
role of IncRNAs in the pathogenesis of this disease. High- 
throughput microarray techniques revealed a set of differentially 
expressed IncRNAs, including 1,213 that were upregulated and 
1,207 that were downregulated in lung adenocarcinoma tissue 
compared to normal lung tissue. LncRNAs are usually divided into 
five categories: sense, antisense, bidirectional, intronic, and 
intergenic. LncRNAs are known to function via a variety of 
mechanisms; however, a common and important function of 
IncRNAs is to alter the expression of nearby encoding genes by 
affecting the process of transcription [24] or direcdy playing an 
enhancer-like role [25,26]. In the present study, we increased the 
accuracy of target prediction by comparing differentially expressed 
mRNAs with differentially expressed IncRNAs. The IncRNA 
expression profiles indicated that 564 IncRNAs were differentially 
expressed (338 upregulated and 226 downregulated) between lung 
adenocarcinoma and normal lung samples. The expression 
profiles included 77 differentially expressed enhancer-like 
IncRNAs, with 43 upregulated and 34 downregulated. We also 
identified some nearby coding genes that may be regulated by 
IncRNAs and enhancer-like IncRNAs. Moreover, we also 
performed HOX cluster profiling of IncRNAs and coding 
transcripts. 

Otherwise,we build the LncRNA-mRNA co-expression network 
and we analyzed four important IncRNAs, including RP1 1-1 CI. 7, 
RPLP0P2, LOC100132354, RP1 1-909N17.3. It shown that 
RP11-1C1.7, RPLP0P2, LOC100132354, RP1 1-909N17.3 were 
individually relation to some mRNAs. 
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